AITopics | pv 1

Collaborating Authors

pv 1

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reinforcement Learning with Lookahead Information

Neural Information Processing SystemsFeb-15-2026, 22:37:38 GMT

In reinforcement learning (RL), agents sequentially interact with a changing environment, aiming to collect as much reward as possible.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Jordan (0.04)
Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.04)

Genre: Research Report > Experimental Study (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

GPU-Accelerated Counterfactual Regret Minimization

Kim, Juho

arXiv.org Artificial IntelligenceSep-6-2024

Counterfactual regret minimization is a family of algorithms of no-regret learning dynamics capable of solving large-scale imperfect information games. We propose implementing this algorithm as a series of dense and sparse matrix and vector operations, thereby making it highly parallelizable for a graphical processing unit, at a cost of higher memory usages. Our experiments show that our implementation performs up to about 352.5 times faster than OpenSpiel's Python implementation and up to about 22.2 times faster than OpenSpiel's C++ implementation and the speedup becomes more pronounced as the size of the game being solved grows. Counterfactual regret minimization (CFR) (Zinkevich et al., 2007) is a family of algorithms of noregret learning dynamics capable of solving large-scale imperfect information games. Its variants dominated the development of AI agents for large imperfect information games like Poker (Tammelin et al., 2015; Moravčík et al., 2017; Brown & Sandholm, 2018; 2019b) and The Resistance: Avalon (Serrino et al., 2019) and were components of ReBeL (Brown et al., 2020) and student of games (Schmid et al., 2023).

implementation, openspiel, pvq, (16 more...)

arXiv.org Artificial Intelligence

2408.14778

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Texas (0.04)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.34)

Add feedback

Extreme value statistics for censored data with heavy tails under competing risks

Worms, Julien, Worms, Rym

arXiv.org Machine LearningJan-19-2017

In general, the interest lies in obtaining informations about the central characteristics of the underlying lifetime distribution (mean lifetime or survival probabilities for instance), often with the objective of comparing results between different conditions under which the lifetime data are acquired. In this work, we will address the problem of inferring about the (upper) tail of the lifetime distribution, for data subject both to random (right) censoring and competing risks. Suppose indeed that we are interested in the lifetimes of n individuals or items, which are subject to K different causes of death or failure, and to random censorship (from the right) as well. We are particularly interested in one of these causes (this main cause will be considered as cause number k thereafter, where k P t1,..., Ku), and we suppose that all causes are exclusive and are likely to be dependent on the others. The censoring time is assumed to be independent of the different causes of death or failure and of the observed lifetime itself.

artificial intelligence, estimator, pv 1, (17 more...)

arXiv.org Machine Learning

1701.05458

Genre: Research Report (0.82)

Industry: Law > Civil Rights & Constitutional Law (1.00)

Technology: Information Technology > Artificial Intelligence (0.45)

Add feedback

The Perturbed Variation

Harel, Maayan, Mannor, Shie

arXiv.org Machine LearningOct-15-2012

We introduce a new discrepancy score between two distributions that gives an indication on their similarity. While much research has been done to determine if two samples come from exactly the same distribution, much less research considered the problem of determining if two finite samples come from similar distributions. The new score gives an intuitive interpretation of similarity; it optimally perturbs the distributions so that they best fit each other. The score is defined between distributions, and can be efficiently estimated from samples. We provide convergence bounds of the estimated score, and develop hypothesis testing procedures that test if two data sets come from similar distributions. The statistical power of this procedures is presented in simulations. We also compare the score's capacity to detect similarity with that of other known measures on real data.

artificial intelligence, machine learning, similarity, (17 more...)

arXiv.org Machine Learning

1210.4006

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)

Add feedback